Automatic methods for lexical stress assignment and syllabification
نویسندگان
چکیده
Improvements in automatic lexical stress assignment and syllabification can increase the quality of text-to-speech synthesis as well as decrease the memory requirements for dictionaries. Several methods were evaluated. Machine-learning based methods are preferred since they easily adapt to multiple languages. For stress prediction, encouraging results were obtain by combining a decision tree approach with an algorithm that uses global (word level) statistical data derived from the training dictionary. For syllable boundary prediction, algorithms that learn syllable level statistics from the training dictionary perform very well, and can be implemented as a post-process after prediction of phoneme transcription and stress.
منابع مشابه
Automatic word stress marking and syllabification for Catalan TTS
Stress and syllabification are essential attributes for several components in text-to speech (TTS) systems. They are responsible for improving grapheme-to-phoneme conversion rules and for enhancing the synthetic intelligibility, since stress and syllable are key units in prosody prediction. This paper presents three linguistically rule-based automatic algorithms for Catalan text-to-speech conve...
متن کاملA Semi - Automatic System for the Syllabification and Stress Assignment
This Master's Thesis concerns research in the automatic analysis of the sub-lexical structure of English words. Sub-lexical structure includes linguistic categories such as syllabification, stress, phonemic representation, phonetics, and spelling. This information could be very useful in all sorts of speech applications, including duration modeling and speech recognition. ANGIE is a system that...
متن کاملPhonological Processing for Urdu Text to Speech System
Determining and modeling phonological phenomena is necessary to generate speech from textual input. These phenomena include letter to sound conversion, syllabification, sound change, stress assignment and intonation assignment. This paper presents work on Urdu phonological processes and provides algorithms to convert textual input into phonologically annotated output, required for Urdu text-to-...
متن کاملLinguistic-prosodic processing for text-to-speech synthesis in italian
The linguistic-prosodic processing applied to text-to-speech synthesis in Italian is described. It proceeds in 5 steps: tokenisation and normalisation of abbreviations, numbers, etc.; part-of-speech tagging, based on function words, terminations and contextual heuristics; shallow parsing, based on a chunk grammar; grapheme-to-phoneme conversion, lexical stress assignment and syllabification by ...
متن کاملAutomatic syllabification in English: a comparison of different algorithms.
Automatic syllabification of words is challenging, not least because the syllable is not easy to define precisely. Consequently, no accepted standard algorithm for automatic syllabification exists. There are two broad approaches: rule-based and data-driven. The rule-based method effectively embodies some theoretical position regarding the syllable, whereas the data-driven paradigm tries to infe...
متن کامل